Nullification of the Nullification 
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A recent claim by Meyer, Kent and Clifton (MKC), that their models "nullify" the Kochen- 
Specker theorem, has attracted much comment. In this paper we present a new counter-argument, 
based on the fact that a classical measurement reveals, not simply a pre-existing value, but pre- 
existing classical information. In the MKC models measurements do not generally reveal pre-existing 
classical information. Consequently, the Kochen-Specker theorem is not nullified. We go on to prove 
a generalized version of the Kochen-Specker theorem, applying to non-ideal quantum measurements. 
The theorem was inspired by the work of Simon et al and Larsson (SBZL). However, there is a 
loophole in SBZL's argument, which means that their result is invalid (operational non-contextuality 
is not inconsistent with the empirical predictions of quantum mechanics). Our treatment resolves 
this difficulty. We conclude by discussing the question, whether the MKC models can reproduce the 
empirical predictions of quantum mechanics. 



I. INTRODUCTION 



Prior to the work of Meyer |ffl], the Kochen-Specker theorem (KS theorem) was generally regarded as one of 

the key foundational results of quantum mechanics, establishing one of the most important ways in which quantum 
| mechanics enforces a radical departure from the assumptions of classical physics. Classically, a measurement is a 
process which reveals the pre-existing value of the observable measured. Until recently it was generally accepted that 
' the KS theorem shows that quantum mechanics cannot be interpreted in such a way as to preserve this feature of 
<^> classical physics. 

Unfortunately, the KS theorem, as originally formulated, only applies to ideal, or von Neumann measurement 
processes. In practice, strict ideality is seldom, and perhaps never actually attainable. It follows that the KS 
• theorem, in its original form, is not sufficient to demonstrate a contradiction between classical assumptions and the 
f> i empirically verifiable predictions of quantum mechanics, regarding the outcome of real, laboratory measurements. 

At first sight, this may not seem a serious problem. The natural assumption would be that the original KS theorem 
is a limiting case of a more general theorem, which does apply to non-ideal measurements. We will eventually argue 
that this is, in fact, the correct assumption. However, the question is greatly complicated by the work of Meyer [Q, 
Kent 0] and Clifton and Kent || (MKC in the sequel) , who have argued that "finite precision measurement nullifies 
the Kochen-Specker theorem" . 

MKC's claimed "nullification" of the KS theorem has given rise to some controversy. It has been discussed by (in 
chronological order) Cabello @, Havlicek et al @, Mermin @, Appleby Simon et al p7[ , Larsson [ jl8| , 

Simon |nj , Cabello |2(J and Boyle and Schafir J2lj] . For further discussion of the problem of demonstrating an empirical 
contradiction with the predictions of non-contextual hidden variables theories see Cabello and Garcia- Alcaine p2[ , 
Basu et al |p3[ , Cabello J24|, Michler et al p5[ , Simon et al ]2(| and Cereceda (^7). Finally, it should be noted that 
MKC's argument was inspired by the previous work of Pitowsky 0, Hales and Straus (lO) and Godsil and Zaks pd| ] 
(we briefly comment on Pitowsky 's work in the conclusion). 

The arguments in this paper are largely new. They go significantly beyond any that have previously been given. 
We begin by nullifying MKC's claimed nullification. That is, we show that MKC's ingenious mathematical construc- 
tions, though deeply interesting in their own right, do not invalidate the essential physical point of the KS theorem. 
Having cleared the conceptual ground, we then go on to establish a generalized version of the KS theorem, which 
does apply to non-ideal measurements. In order to make the treatment comprehensive we conclude by discussing a 
number of related questions. In particular, we discuss a recent claim by Cabello |20|, that the MKC models do not 
reproduce the empirically verifiable predictions of quantum mechanics^ 
Our main criticism of MKC's argument is contained in Sections O- 
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Section || contains some preliminary considerations. MKC assume that an experimenter can never precisely know 
what has "actually" been measured. This assumption plays an important role in their argument because it enables 
them to postulate that it is physically impossible to measure each of the observables in a KS-uncolourable set. We 
show that the assumption depends on some misconceptions regarding non-ideal quantum measurements. 

Section III contains the crux of the critical part of our argument. The physical point of the KS theorem is to 
show that quantum measurements cannot generally be interpreted as measurements in the classical sense. A classical 
measurement is not simply a process which reveals a pre-existing value. Rather, it is a process which reveals a pre- 
existing piece of classical information, represented by a proposition of the form "observable A took value x" . For this 
condition to be satisfied it is essential that the value x and observable A both be adequately specified. It appears to 
MKC that their models nullify the KS theorem because, in their models, a measurement does always reveal the value of 
something. However, it does not reveal the precise identity of that something. Since the MKC valuations are radically 
discontinuous this means that measurements do not generally reveal any classical information. Consequently, they 
cannot generally be regarded as measurements in the classical sense. It follows that the KS theorem is not nullified. 



Section IV contains some supplementary considerations, which reinforce the conclusion reached in Section III 



The argument just outlined goes beyond the arguments given in our previous papers because it shows that MKC's 
claim, to have nullified the KS theorem, is sirm 3 ly false. In Appleby we showed that the MKC models are 

highly non-classical. In fact, we showed in ref. that the MKC models exhibit a novel kind of contextuality, which 
is even more strikingly at variance with classical assumptions than the usual kind of contextuality, featuring in the 
KS theorem. Consequently, the argument in ref. is, by itself, sufficient to refute MKC's suggestion, that their 
models provide a classical explanation of non-relativistic quantum mechanics. The argument in ref. |l5[ ], though less 
clear-cut, lends additional support to this conclusion. However, neither argument directly bears on MKC's claim, 
to have nullified the KS theorem as such. Indeed, at the time we wrote these papers it appeared to us that MKC 
actually had "nullified the KS theorem strictly so-called" (as we put it). 

It is important to establish that finite precision does unequivocally not nullify the KS theorem because, in the 
absence of such a demonstration, it is impossible to achieve an unobstructed understanding of contextuality, as it 
applies in a real, laboratory setting. The question has immediate, practical relevance, in view of the current interest 
in experimental investigations of contexuality |l7|-[i1| 22|- p7 | . 

Having cleared the conceptual ground in Sectionsjn |IV| we go on, in Section |v|, to formulate and prove a generalized 
KS theorem, applying to non-ideal measurements. The argument in this section is inspired by the argument of Simon et 
al [[l7||ig| l and Larsson |l8j (SBZL in the sequel). However, there are some important differences. SBZL work with an 
"operational" concept of contextuality. Their motive for introducing this concept is to circumvent MKC's argument. 
Clearly, the motive no longer applies, once it is established that MKC's argument is invalid. We are consequently able 
to formulate a generalized KS theorem in terms of the ordinary concept of contextuality. This has several advantages. 
In the first place it means that the result we prove is a straightforward generalization of the ordinary KS theorem. 
In the second place, there is a loophole in SBZL's argument, which means that their result is actually incorrect. As 
we show in Appendix by means of a counter-example, operational non-contextuality is not inconsistent with the 
empirical predictions of quantum mechanics. The modified argument we give in Section |v| resolves this difficulty. 
Lastly, the result we prove implies a significantly different conclusion, as to the conditions which must be satisfied in 
order refute non-contextual theories experimentally. 

Sections pnV contain the main part of our argument. In Sections Vl-VIH we address a number of related questions. 

In Section~|V] we briefly discuss the POVM arguments given by Kent \ f\ a nd Clifton and Kent Q . 

In Section VII we give an improved version of the argument in Appleby Jl 5[| . In the first place we have improved the 
argument so as to take account of the points made in Section [l| of this paper. In the second place, the version we give 
now does not involve joint measurements of non-commuting observables. In the third place, we have strengthened 
the argument, using ideas derived from the subsequent work of SBZL. 

In Section VIII we discuss the question, whether there actually exists a complete theory of MKC type which is 
empirically equivalent to quantum mechanics. In particular, we discuss a recent claim by Cabello [Q, that theories of 
this type make experimentally testable predictions which conflict with those of quantum mechanics. We argue that, 
although Cabello makes some very pertinent points, the question remains open. 



II. WHAT IS "ACTUALLY" MEASURED? 



MKC's argument partly depends on a misconception, regarding non-ideal quantum measurements. We begin our 
critical discussion by clarifiying this point. The criticisms which follow are preliminary to our main critical argument, 
contained in the next section. 
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MKC postulate that there are many observables which it is physically impossible to measure. If an experimenter 
attempts to measure an observable A in this forbidden set, then what is actually measured is a slightly different 
observable B. The measurement reveals the pre-existing value of the observable B which is actually measured. 

It can be seen from this that MKC tacitly assume: 

Property 1 For a given finite precision measuring apparatus there is a single, uniquely defined observable, 
which is the only observable that is "actually" measured. 

Property 2 The nominal observable, which is recorded in the experimenter's notebook as having been 
measured, is typically not the observable which is "actually" measured. 

We will show that these assumptions are unjustified. 

Before proceeding further, let us clarify the meaning of the term "nominal observable", as it appears in the above 
statement. The term essentially corresponds to Simon et al's |l7j "switch position". However, unlike Simon et al we 
are not appealing to the concept of an "operational observable" (i.e. the concept that different instruments define 
different observables — see Appendix Q). We are simply recognizing the fact that the complete record of a measurement 
(quantum or classical) must include, not only a specification of the value obtained, but also a specification of the 
observable measured. Suppose, for instance, that an experimenter makes 1,000 different measurements of 1,000 
different observables. If the experimenter simply writes down a list of 1,000 numbers, without any indication as 
to how those numbers were obtained, then the record will obviously be useless (this point, that the outcome of a 
measurement [quantum or classical] is, not simply a value, but a determinate proposition, will play a central role in 
the argument of the next section). 

It should be stressed that, although we refer, for convenience, to the actions of a human experimenter, the same 
point would apply if the measurements were performed by a completely arbitrary "information gathering and utilizing 
system" ]28fl . If the system is to utilize the information it acquires then it must record, in its memory area, a 
specification of the observables measured, as well as the values obtained. 

The nominal observable need not be specified precisely. Rather than recording the information "observable S z was 
measured and value +1/2 obtained", one might instead record the information "observable n • S was measured, for 
n € U, and value +1/2 obtained" (where U is some subset of the unit 2-sphere). In the following we will assume 
that the nominal observable is specified precisely, because this is legitimate, and because it happens to be the usual 
practice. However, the argument could easily be modified so as to allow for the possibility that the nominal observable 
is only specified partially. 

Let us now consider MKC's assumption that, for each quantum measurement, there is one, and only one observable 
which is "actually" measured. The fact that this assumption is unjustified becomes apparent when one takes into 
view (as MKC do not) the detailed physical implementation of an abstract quantum measurement scheme. 




FIG. 1. Schematic illustration of the Stern-Gerlach arrangement. The surface irregularities of the pole pieces, and asym- 
metries in the environment, mean that the field is not perfectly symmetric. There is no single direction which represents the 
alignment of the field more truly than any other. It follows that there is no single spin component which is measured more 
truly than any other. 

Consider, for example, a non-ideal measurement of the spin component n • S using a Stern-Gerlach apparatus. The 
vector n is determined by the axis of symmetry of the arrangement. MKC take it that, in such a case, although the axis 
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is not precisely known to the experimenter, there does actually exist a single, sharply defined axis of symmetry; and 
that there correspondingly exists a single, sharply defined spin component n • S which is the only spin component that 
is actually measured. However, this is to overlook the fact that the magnetic field will not, in practice, be perfectly 
symmetric (due to irregularities in the shape of the pole-pieces, asymmetries in the placement of surrounding objects, 
etc.). As illustrated in Fig. ^ the numerous slight departures from perfect symmetry introduce some unavoidable 
"fuzziness" into the concept "axis of symmetry of the field" . Under these conditions there is no single vector n which 
represents the axis of symmetry more truly than any other. This being so, there are no obvious grounds for picking 
out any single spin component as the only component which is "actually" measured. 

A measurement pfj{ of n - S is a process which discriminates the eigenstates of n- S. Given an unknown eigenstate of 
n-S, an ideal measuring instrument will inform the experimenter, with certainty, which particular eigenstate it was. A 
non-ideal measuring instrument will inform the experimenter, with high probability, which particular eigenstate it was. 
This is the definition of a quantum measurement process p0| |: if a process performs the function, of discriminating 
the eigenstates of an operator, then it is a measurement of that operator, ideal or non-ideal as the case may be. 

Of course, it is seldom, if ever the case that a real laboratory instrument is strictly ideal, and the Stern-Gcrlach 
arrangement is no exception to this rule. It is important, furthermore, to note that, not only does a Stern-Gerlach 
arrangement fail to perform an ideal measurement of the nominal spin component, which is recorded in the experi- 
menter's notebook as having been measured; it fails to perform an ideal measurement of any other spin component 
either (for a discussion of the unavoidable sources of non- ideality in Stern-Gerlach measurements see Busch et al |3lf ) . 
The arrangement may, however, be used to perform non-ideal measurements (which is the most that can reasonably 
be demanded of a real, laboratory instrument). 

We can characterize the degree of non-ideality in quantitative terms by considering the probability that the process 
will fail to correctly identify a given eigenstate. Suppose that the arrangement is used to measure the component 

n • S for a spin-1/2 particle. Let p_| (n) (respectively p h( n )) be the probability that, when the system particle is 

initially in the spin up (respectively spin-down) eigenstate of n • S the measurement outcome is —1/2 (respectively 
+1/2). Then it may be said that, the smaller these probabilities are, the more nearly ideal |3^] the measurement. 

There is no vector n for which p^ (n) = p |_(n) = (see Busch et al [j3l| ). There are, however, vectors n for 

which the failure probabilities p^ (n), p ^(n) are both small. If n satisfies this condition the arrangement can be 

used to discriminate the eigenstates of n • S with a high degree of reliability. In other words, it can be used to perform 
non-ideal measurements of n • S. 

The failure probabilities p^ (n), p |_(n) vary continuously with n. This means that, if the probabilities are small 

for one vector n, then they will also be small for every other vector which is close to n. It follows that, for a given 
arrangement, there are infinitely many different spin components which are all non-ideally measured. Furthermore the 
class of observables which are non-ideally measured includes the nominal observable (provided it is correctly recorded, 
and provided the instrument does not malfunction). 

We began our discussion of the Stern-Gerlach arrangement by noting that, in practice, the arrangement will not have 
a sharply defined axis of symmetry. It can now be seen that, even if the symmetry were exact, MKC's assumption, 
that there is only one observable that is "actually" measured, would still not be justified. It is true that there might 
(perhaps) then be a well-defined, natural sense in which the measurement of n • S was "most nearly ideal" when n 
was parallel to the axis of symmetry. But it would not follow that a neighbouring component n' • S, defined by a 
vector n' which was not exactly parallel to the axis of symmetry, was not "actually" measured at all. 

A process does not have to be precisely optimal in order to count as a measurement. If the probabilities p^ (n) 

and p |_(n) become slightly larger, then that simply means that the process does not discriminate eigenstates quite 

so efficiently as before. It does not mean that the process thereby ceases to be a measurement. 

The real photon detectors which have been constructed to date are imperfect, in that there is a non-zero probability 
that the detector will fail to register the presence of a photon. This does not mean that a real photon detector does 
not "actually" detect photons at all. Furthermore, a photon detector whose performance becomes slightly degraded, 
so that it is no longer quite so efficient as before, does not, on that account, cease to be a photon detector. 

In general, however, there is no unambiguous, non-arbitrary, physically well-motivated way to define the concept 
"unique spin component for which the measurement is most nearly ideal" . For instance, minimizing the function 

(1/2) (n) + p h( n )); an( i minimizing the function ((1/2) (p H (n) 2 +p ^(n) 2 )) 1 2 leads to definitions of this 

concept which are, in general, incompatible (quite apart from the fact that the minima may not be unique). In the 
general case there are many spin components which all have an equally valid claim to the status "spin component for 
which the measurement is most nearly ideal" . 

Thus far we have been considering the case of the Stern-Gerlach apparatus. However, it is shown in Appendix ^ 
that the above criticisms apply much more generally, to the case of any approximate von Neumann measurement 
(i.e. any non-ideal measurement described by a unitary operator which is close to a unitary operator describing a von 
Neumann measurement). This provides strong grounds for the assertion that Properties 1 and 2 assumed by MKC 
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should be replaced by 

Property 1' For a given finite precision measuring apparatus there arc infinitely many different observables 
which are non-ideally measured. 

Property 2' The nominal observable is among the observables which are non-ideally measured (provided 
the nominal observable is recorded correctly, and provided there is no malfunction in the 
measuring apparatus). 

It may be worth remarking that these statements are also valid classically. Consider, for example a classical 
measurement of the velocity component v z of some macroscopic object, using the Doppler shift of the radiation it 
emits. In order to perform this measurement it is necessary that the line running from source to detector should be 
parallel to the z-axis. The fact that source and detector are both extended objects, which are not perfectly symmetric, 
means that, in practice, this line is not even precisely defined, let alone precisely known. However, this does not imply 
that v z may not be the component which is "actually" measured. Rather, it is to be regarded as one of the sources 
of error in the measurement of v z . A classical measurement of a classical observable A is a process which reveals, 
with a high degree of reliability, and to a high degree of accuracy, the pre-existing value of A. On this definition the 
procedure described effects a non-ideal classical measurement of n • v for every n which is nearly (but perhaps not 
precisely) parallel to the z-axis. 

The effect of the argument in this section is to restore the natural assumption of most physicists, that the observable 
which an experimenter records as having been non-ideally measured is also measured in fact (provided the experimenter 
does not make a mistake, and provided there is no malfunction in the measuring apparatus). 

Let us now consider the bearing which this has on MKC's argument. Our criticisms do not invalidate MKC's 
postulate, that a measurement of A may reveal the value of some other, neighbouring observable B. They do, 
however, show that MKC cannot justifiably claim that B is the only observable which is "actually" measured, and 
that A is not really measured at all. MKC cannot even claim that B is the uniquely defined observable for which the 
measurement is most nearly ideal. In general, the observable B has no preferred status. Properly understood;, MKC's 
postulate comes to this: that the hidden dynamics somehow selects an essentially arbitrary observable B k, A, whose 
value it then reveals. 

The point is important because, once MKC's proposal is formulated correctly, it becomes easier to see the fallacy 
in their argument. 

III. NULLIFICATION OF THE NULLIFICATION 

This section contains the main part of our critical argument. 
The KS theorem shows 

Proposition 1 No hidden variables interpretation can have the property that, for every observable A, an ideal 
measurement of A always reveals the pre-existing value of A. 

MKC's claim, when appropriately re-formulated, so as to remove any unjustified appeal to the concept of "the unique 
observable which is actually measured" , is that this proposition is "nullified" by 

Proposition 2 There do exist hidden variables interpretations with the property that, for every observable 
A, a finite precision measurement of A always reveals the pre-existing value of some unknown 
observable B in a small neighbourhood of A. 

In considering this claim the first problem we face is that the term "nullified" is not part of the standard lexicon of 
science and mathematics, and so its meaning is potentially ambiguous. Propositions 1 and 2 are logically consistent, 
so there can be no question of the KS theorem actually being refuted. It is not immediately apparent what, exactly, 
is meant by the claim that Proposition 2 "nullifies" Proposition 1, without contradicting it. 

In this connection it should be noted that the reference to finite precision measurements, on which MKC themselves 
place much emphasis, is actually irrelevant. MKC postulate that, for each non-ideal measuring apparatus, the hidden 
dynamics somehow arbitrarily selects an observable B, close to the nominal observable, whose value the measurement 
then reveals. One could, with equal justification, postulate that the same is true for each ideal measuring apparatus: 
implying 

Proposition 2' There do exist hidden variables interpretations with the property that, for every observable A, 
an ideal measurement of A always reveals the pre-existing value of some unknown observable 
B in a small neighbourhood of A. 
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If the KS theorem is nullified by Proposition 2, then it must, presumably, also be nullifed by Proposition 2'. It would 
seem to follow that such nullification as there might be cannot be attributed specifically to the finite precision. 

In the following we will take the claim to be that the MKC models invalidate what had previously been regarded 
as the key physical implication of the KS theorem: namely, the implication that quantum measurements cannot 
systematically be interpreted as measurements in the classical sense. In the MKC models a measurement does 
not always reveal the value of the nominal observable. It does, however, always reveal the value of an observable 
which is extremely close to the nominal observable. MKC assume that is enough to recover the classical concept of 
measurement. 

The question we now have to decide is whether MKC are correct to assume that a process which does not reveal the 
pre-existing value of the nominal observable may still count as a measurement in the classical sense. We will argue 
that they are not. 

The crucial point to realize is that a classical measurement is not simply a process which reveals a pre-existing 
value. Rather, it is a process which reveals a pre-existing piece of classical information, represented by a proposition 
of the form "observable A took value x" . For this to be true it is essential that the observable A and the value x both 
be adequately specified. A process which reveals the value of some completely unknown observable does not reveal 
any classical information, and is not a classical measurement. 

The following example may serve to illustrate this point. Suppose that an experimenter is given a sealed box, 
containing 100 individually labelled rods. Suppose that, for each integer n = 1, . . . , 100 the box contains exactly one 
rod of length n cm. Suppose that the experimenter, without looking inside the box, uses a random number generator 
to select a particular integer 1 < n < 100. It could be said that this procedure reveals the length of one of the rods in 
the box. However, it does not reveal the identity of that rod. Consequently, the process does not reveal any classical 
information, and so it cannot be considered a classical measurement. 

It may, at first sight, seem that these remarks do not apply to the situation envisaged by MKC since, in their 
models, the observable B, whose value is revealed by the measurement, is not completely unknown. In fact, it is 
assumed that B is extremely close to the nominal observable A: so there is a sense in which B is specified very 
precisely. However, it is not specified precisely enough. 

The point here is that the MKC valuations are radically discontinuous. If the valuation could be assumed continuous 
in the vicinity of A, then a process which revealed the value of some neighbouring observable B could be regarded 
as a measurement in the classical sense. However, there is in fact always a region on which the valuation is highly 
discontinuous, and so this assumption would not generally be justified. 

Consider, for example, the colourings of the unit 2-sphere S 2 discussed by Meyer |jj (and described in more detail 
by Havlicek et al Jl^]). These colourings have the property that, given any n G S 2 , every neighbourhood of n contains 
infinitely many points evaluating to 0, and infinitely many points evaluating to 1. Suppose, now, that one performs a 
finite precision measurement of the nominal observable (n • S) 2 and obtains (say) the value 1. Then one knows that, 
there is a point n' in some small neighbourhood U of n which evaluates to 1. However, U contains infinitely many 
points evaluating to 1, and infinitely many points evaluating to 0. The process does not reveal any more information, 
as to which particular point takes which particular value, than could be obtained by tossing a coin. Consequently, 
it cannot be considered a classical measurement. It is not a classical measurement for the same reason that random 
number generators cannot be used to make classical measurements of length. 

Essentially the same criticism applies to an arbitrary model of MKC type. In the general case the highly discon- 
tinuous behaviour just discussed may not occur everywhere, on the whole of S* 2 . However, there will always be a 
non-empty open subset of S 2 on which such behaviour occurs. 

The detailed proof of this statement is given in Appendix We will confine ourselves here to summarising the 
main points. 

Consider an arbitrary MKC colouring /: Sq — > {0,1}. Here Sq is any dense, KS-colourable subset of the unit 
2-sphere S 2 having the property that the set of triads contained in Sq is dense in the space of all triads (MKC assume 
that S 2 is also countable; however the argument which follows does not depend on this assumption). 

We define the discontinuity region D C S 2 to consist of those points n 6 S 2 with the property that each neigh- 
bourhood of n contains infinitely many points evaluating to 0, and infinitely many points evaluating to 1 (it is not 
assumed that n itself € S 2 ). We define the continuity region C C S 2 to consist of those points n € S 2 with the 
property that / is continuous on U (1 Sq, for some neighbourhood U of n. In the case of the Meyer colourings C = 
and D = S 2 . However, this is not true generally. 

It is readily verified that these regions partition S 2 (the complete unit 2-sphere, not just Sq) into two disjoint 
subsets: C U D = S 2 and C n D = 0. It is also readily verified that C is open and D is closed. 

The key result, proved in Appendix |b| is that C is not only open, but also KS-colourable. Furthermore, the 
KS-colouring / defined on C H Sq uniquely extends to a continuous, induced KS-colouring f:C — * {0, 1}. 

We show in Appendix [B| that this result places significant constraints on the minimum size of the discontinuity 
region. In the first place, D must have non-empty interior (i.e. it must contain a non-empty open subset — implying that 
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it cannot simply consist of a set of isolated points or lines). In the second place, there is a fixed, model- independent, 
non-zero lower bound on the solid angle subtended by D. 

Let us now consider the bearing of these results on MKC's claim, to have "nullified" the KS-thcorem. The argument 
we gave above (in connection with the Meyer colourings) shows that the /-value assigned to a specific point e D n S 2 
cannot be ascertained by any quantum measurement and is, in this sense, inaccessible. On the other hand the /-value 
assigned to a specific point eCn^ can be ascertained by a quantum measurement, and so it is accessible (provided 
the observable whose value is revealed is sufficiently close to the nominal observable). It is easy to see that the same 
is also true of the /-value assigned to a specific point € C (where / is the induced KS-colouring defined above). 

From an empirical point of view it is not the /-values defined on So which are significant, but the /-values defined 
on C. It is only when n £ C that a quantum measurement of (n • S) 2 can be interpreted as a measurement in the 
classical sense, which reveals the corresponding /-value (provided the observable whose value is revealed is sufficiently 
close to the nominal observable). 

The fact that there exist non-empty, KS-colourable open subsets of S 2 is trivial (consider, for example, the cone 
C = {n e S 2 : \n z \ > l/y/2}). There is no sense in which this fact can validly be said to nullify the KS theorem. 

In conclusion, it can be seen that the finely calculated approach of MKC, where one marks in individual points 
using a pencil, gets no nearer to "nullifying" the KS theorem than does a cruder approach, where one paints in whole 
open regions using a brush. It can also be seen that a would-be nullifier gets no nearer to his/her goal by postulating 
that a measurement reveals the value of an observable which is typically not the nominal observable. 

IV. NULLIFYING MADE EASY 

This section contains some supplementary considerations. Its purpose is to reinforce the point made in the last 
section that, for a model to nullify the KS theorem, rather more is required than the bare fact, that a measurement 
always reveals the pre-existing value of something. 

If the KS theorem could be nullified in the manner that MKC propose, then one could achieve that end more 
straightforwardly, without appealing to MKC's sophisticated set-theoretic constructions, by means of one of the 
following, much less sophisticated models. 

Since it is a hidden variables theory that is in question, Ockham's razor does not apply, and so we are free to 
multiply hypothetical entities at will. Let us accordingly postulate that, corresponding to each observable (n • S) 2 
of the conventional theory, there exist two beables sa(ii) and ss(n). Each beable is assigned the value or 1. We 
postulate that these values fluctuate randomly, and independently, but in such a way that, for each n, at every time, 
exactly one of the beables sa(ii), Ss(n) is assigned the value 0, and exactly one is assigned the value 1. In every other 
respect the model coincides with conventional quantum mechanics. We are thus assuming that the values taken by 
the beables have no causal influence on the outcome of a measurement — or, for that matter, on anything else. Their 
function is simply to exist. We will refer to this as model 1. 

It is trivially true that, in model 1, the effect of measuring (n • S) 2 is to reveal the pre-existing value of one of 
the two quantities Syt(n) or s_e(n). This is to be compared with the situation in the MKC models, where the effect 
is to reveal the value of one out of an infinite set of vectors contained in some small neighbourhood of n. It would 
consequently seem that, if one is to accept that the MKC models nullify the KS theorem, then one must accept that 
model 1 nullifies it equally well. 

Of course, it is intuitively evident that the KS theorem is not really nullified by model 1. However, it is worth 
examining the reason for this. 

Model 1 does not nullify the KS theorem because, even though the measurement reveals a pre-existing value, the 
experimenter does not learn anything thereby. The experimenter already knows, before the measurement, just from 
the way the model is defined, that exactly one of the two quantities s^n) or s_e(n) takes the value 0. The experimenter 
does not know more after the measurement. The procedure cannot be used to acquire additional knowledge concerning 
SA(n), ss(n) and so it is not, in the classical sense, a measurement of these quantities. In short, model 1 does not 
nullify the KS theorem for essentially the same reason that the MKC models do not nullify it. 

It is also illuminating to consider the following, slightly different model, which we will refer to as model 2. Model 2 
is the same as model 1, except that the quantities S/t(n), ss(n) are not assumed to fluctuate. Instead, it is postulated 
that, for each n, s^n) always takes the value 0, and ss(n) always takes the value 1. 

In model 2 a measurement of (n • S) 2 not only reveals the pre-existing value of s^n) or ss(n): the experimenter 
even knows which of the two quantities took this value. Nevertheless, there is still no nullification. Suppose that an 
experimenter measures (n- S) 2 and obtains the value 0. Then, on the assumptions of model 2, the experimenter knows 
that SA(n) took the value 0. However, the experimenter knew this already, before carrying out the measurement. 
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The procedure has not conveyed any additional information, regarding Syi(n), and so it is not, in the classical sense, 
a measurement of sa(h). 

The triviality of the constructions makes it evident that models 1 and 2 do not really nullify the KS theorem. 
However, the underlying fallacy is essentially the same as the fallacy underlying the much more sophisticated arguments 
of MKC. The only difference is that, in the case of the MKC models, partly because the constructions are so much 
more sophisticated, the fallacy is less obvious. 



V. KS THEOREM FOR NON-IDEAL MEASUREMENTS 

In Sections || [fv| we established the negative result: that finite precision does not "nullify" the KS theorem. The 
purpose of this section is to establish the positive result: that the KS theorem has a natural generalization, applying 
to the non-ideal measurements which might actually be performed, in the laboratory. 

The original version of the KS theorem shows 

Proposition 1 No hidden variables interpretation can have the property that, for every observable A, an ideal 
measurement of A always reveals the pre-existing value of A. 

— meaning that ideal quantum measurements cannot systematically be interpreted as ideal classical measurements, 
of the same observables. 

The generalized version of the KS theorem shows 

Proposition 1' No hidden variables interpretation can have the property that, for every observable A, a near- 
ideal measurement of A always has a high probability of revealing a good approximation to the 
pre-existing value of A. 

— meaning that near-ideal quantum measurements cannot systematically be interpreted as near-ideal classical mea- 
surements, of the same observables. 

Proposition 1' is a qualitative statement, since the terms "near-ideal", "high probability" and "good approximation" 
are not sharply defined. In the following we will prove an inequality which gives unambiguous, quantitative expression 
to Proposition 1', as it applies to the standard example of a spin 1 system (see Inequality (^) below). 

Proposition 1' does not exclude the possibility that a near-ideal quantum measurement may reveal the pre-existing 



value of something other than the nominal observable. As we saw in Sections III and IV this possibility has no bearing 
on the essential point of the KS theorem, which is to show that there is a radical difference between the quantum 
mechanical and classical concepts of measurement. 

Proposition 1' does not only exclude the possibility of interpreting quantum measurements as ideal classical mea- 
surements. This is important, since there would otherwise be no conflict with classical assumptions. One does not 
classically expect a real, laboratory measurement to be strictly ideal (in the classical sense). 

The argument which follows was inspired by the work of Simon et al @@ and Larsson @ (SBZL). However, 
we do not use their "operational" definition of contextuality. This makes the relationship with the original KS 
theorem much clearer. Furthermore, there is a loophole in SBZL's argument, which means that the result stated by 
them is actually invalid [operational non-contextuality is not inconsistent with the empirical predictions of quantum 
mechanics). Lastly, the result we prove implies a significantly different conclusion, as to the conditions which must 
be satisfied in order to refute non-contextual theories experimentally. For further discussion of SBZL's approach see 
the end of this section, and Appendix 

Consider the standard case of a spin-1 system, with spin vector S. Consider an apparatus which can be used to 
perform a joint, possibly non-ideal measurement of the observables (ei ■ S) 2 , (e2 • S) 2 , (e3 • S) 2 for any orthonormal 
triad r = {ei, e 2 , e 3 }. 

We are here assuming that it is physically possible to (non-ideally) measure every triad r — contrary to what is 
postulated by MKC. The justification for this assumption is that, if it is physically possible to (non-ideally) measure 
one triad, then it follows from rotational invariance, together with the argument in Section || and Appendix |A| of 
this paper, that it is physically possible to (non-ideally) measure every other triad. Suppose, for instance, that one is 
given an apparatus which (non-ideally) measures the triad r, and one wants to measure the triad r' which is obtained 
by rotating r through the Euler angles 8, 4>, ip. Then one simply rotates the apparatus through these angles. The fact 
that the rotation cannot be performed with infinite precision does not mean that the apparatus may not measure r' 
at all. It only means that the measurement of r' may be less nearly ideal than would otherwise be the case (i.e. it 
may discriminate the eigenstates of r' less reliably than would otherwise be the case). For further discussion of the 
practical realization of such measurements see Swift and Wright [B3[ . 
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Let p q (r, ip) be the probability that a measurement of r results in one of the "illegal" combinations 
000, 001, 010, 100, 111, given that the system was prepared in the state ip. Define 

e q = sup(p q (r, VO) (1) 

The quantity e q partially characterizes the degree of non-ideality of the quantum measuring device (the characteriza- 
tion is only partial because the condition e q = 0, though necessary, is not sufficient for strict ideality). It corresponds 
to SBZL's quantity e (mutatis mutandi). 

Let f\(n) be the pre-existing value of (n ■ S) 2 when the hidden state is A (it should be stressed that f\ is defined 
for every n, and so it is not a KS-colouring). Let p c (r, A) be the probability that, with the system initially in the 
hidden state A, a measurement of r fails to reveal the ^-values of r. Define 

e c = sup(p c (r, A)) (2) 

The quantity e c characterizes the degree of non-ideality of the apparatus regarded as a classical measuring device, 
whose function is to reveal pre-existing values. It should be noted that e c depends on the particular hidden variables 
theory considered. Unlike e q it has no empirical significance. 
We will now prove 

Ne q + e c > 1 (3) 

where N is the smallest number of triads which do not admit a consistent colouring. 

This inequality shows that e q and e c cannot both be made arbitrarily small. It thus gives precise, quantitative 
expression to the qualitative statement of Proposition 1'. 

Proposition 1' is a general statement, so it allows for the possibility that a measurement may reveal a good 
approximation to the pre-existing value, without revealing it exactly. In the case considered here there are only two 
possible values, and so this contingency cannot arise. 

To prove this inequality, consider the probability Pi(r, A) that, when the hidden state is A, a measurement of r gives 
one of the "illegal" results 000, 001, 010, 100, 111. We have, for all tp, 

Pq(r,ip) = / Pi(T,\)dVj, (4) 
J A 

where A is the hidden state space (of the system by itself) , and is the probability measure on A which corresponds 
to the quantum state i/>. 

Now choose a KS-uncolourable set which contains the smallest possible number of distinct orthornormal triads. 
Let n, . . . , tjv be an enumeration of these triads. By construction there exists, for each A, an index rx such that the 
/A-values of t Tx are "illegal" . It is then a straightforward consequence of the definitions that 

1 -Pi( r r A ,A) <p c (T rx ,X) (5) 

implying 

Pi(Tr x , A) > 1 - e c (6) 
for all A. Taking this result in conjunction with Eqs. (|j) and (Q) we deduce 

N , ✓ N 

l{T rx ,X)dui>l-e c (7) 



Ne q > ][> q (Tv,^) = f (X>(T r) A)) dvf > f Pi ( 

r=l J V=l ' J 



— which is the result claimed. 

It is worth remarking that this argument actually establishes the stronger, state-dependent inequality 

Ne q (ip) + e c >l (8) 

where e q (V>) = sup r (p q (r,-0))- 

If the measurement faithfully reveals pre-existing values then Inequality ([?]) implies A^q > 1. This is similar to the 
result claimed by SBZL p7|-[l9| , that A^q > 1 in any theory which is operationally non-contextual (substituting our 
e q for their e). However, it is not the same because a theory which does not faithfully reveal pre-existing values may 
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still be non-contextual in the sense of SBZL's operational definition. SBZL's claim is actually incorrect, since there 
exist operationally non-contextual theories with the property that Ne q = (see Appendix |c|) . 

Simon et al's |0 statement, that there is a conflict with classical assumptions as soon as e q < l/N, also requires 
modification. Suppose that e q is only slightly less than < l/N. Then it is true that the classical failure probability 
cannot be strictly zero. However, it may still be very small — which would be entirely consistent with classical 
assumptions regarding a real laboratory instrument. Indeed, the possibility is not excluded that < e c <C e q < l/N — 
implying that the apparatus actually functions much better when regarded from the classical point of view, as an 
instrument for revealing pre-existing values, than it does when regarded from the quantum point of view, as an 
instrument for discriminating eigenstates. It follows that, in order to demonstrate an experimental conflict with the 
classical concept of measurement, it is not enough simply to construct an apparatus for which e q < l/N . Instead, one 
must reduce e q to the point where the lower bound on e c becomes ~ 1. 

The purpose of this section was to establish the basic point of principle, that the KS theorem does have a valid 
generalization applying to non- ideal measurements. For this purpose it was sufficient to confine ourselves to the single 
example of a spin 1 system. However, it would clearly be interesting to examine other examples. Indeed, we have 
not shown that Inequality (|^) is the only, or even the most useful inequality applying a spin 1 system. One would 
expect there to be many different ways to give quantitative expression to the qualitative principle represented by 
Proposition 1'. This is a question which requires further investigation. 



VI. KENT AND CLIFTON'S POVM ARGUMENT 

In this section we briefly consider the argument of Kent |?J and Clifton and Kent Q to (as they put it) "rule out 
falsifications of non-contextual models based on generalized observables represented by POV measures" . We include 
this section for the sake of completeness. A reader who is so inclined may skip to the next section without loss of 
continuity. 

A POVM (or positive operator valued measure pl| , p4| -|3^] ) is an indexed set of positive operators F\ with the 
property J^. Fj = 1 (following Kent and Clifton we take the set to be finite). Kent and Clifton take the view that to 
each distinct POVM there corresponds a distinct generalized observable. A generalized observable, in effect, simply 
is a POVM. 

Kent and Clifton consider that a non-contextual theory of generalized observables would be one in which generalized 
measurements reveal the pre-existing truth values of the corresponding POVMs. Their argument to show that such 
theories exist is similar to the argument which, they claim, nullifies the ordinary KS theorem. The objection to it is 
also similar. 

Let A be the set of all positive operators. Kent and Clifton postulate that the POVM which is actually measured 
is always contained in a certain dense subset Ad C A, which is colourable in a generalized sense. If F,; is the POVM 
that is actually measured, then the outcome will be the unique index ig for which F\ evaluates to "true" . 



The objection to this argument is essentially the point we have already made, in Sections III and [V. The problem is 
that the experimenter does not know the identity of the positive operator whose truth value is revealed. Suppose that 
an experimenter measures the nominal POVM Pi, and obtains the result iq. Then s/he knows that there is another, 
typically different POVM P! for which p! is "true" . The experimenter knows that something is "true" . However, 
the experimenter does not know what that something is. Consequently, the procedure is not genuinely informative. 
If one is told "proposition p is true" , but is not given any indication what the symbol p denotes, then one has not 
really been told anything. 

Kent and Clifton consider that their POVM argument includes, as a special case, an alternative proof that non-ideal 
measurements of conventional observables can be explained non-contextually pTflj . Let us consider how this might 
work. 

POVMs play an important role in the mathematical description of non-ideal measurements of conventional observ- 
ables . C onsider, for example, the non-ideal measurement of the conventional observable A = a|a)(a| described by 
Eq. (A2) in Appendix [A|. Define 

E a = \a}(a\+J2% a a\a)(b\+J2 e t,aa\b){a\+ ]T e^e^M (b 2 \ (9) 

b b &i,b2,c 

It is readily verified that the operators E a constitute a POVM. Their physical significance is that, if the system is 
initially in the state \tp), then (il)\E a \ip) is the probability that the result of the measurement will be a. 

Let us note, parenthetically, that it may be questioned whether the concept of a generalized observable is appropriate 
in the case of POVMs which arise in this way, as part of the mathematical description of non-ideal measurements of 
conventional observables [115|,K9| . 
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Clifton and Kent consider that, in the case of the measurement process described by Eq. (A2), what is actually 
measured is, not really A at all, but rather the POVM E a . Of course, the unitary operator U in Eq. (A2), and the 
POVM E a defined in terms of it, could not, in practice, be known, with infinite precision. Consequently, Clifton and 
Kent's POVM argument, when specialized to non-ideal measurements of conventional observables, establishes the 
following: there exist models in which the effect of non-idcally measuring an observable A — o.\a)(a\ is to reveal 
the pre-existing truth values of some unknown POVM E a w |a)(a|. Conceived as a way of nullifying the KS theorem 



this result is no more effectual than Proposition 2, which we criticized in Sections III and IV 



VII. CONTEXTUALITY OF SEQUENTIAL ANALYZERS 



In this section we discuss the predictions of the MKC models regarding sequential measurements, using three 
separate analyzers. If the arrangement is near-ideal, then the observable whose value is revealed by one analyzer 
must depend on the settings of the other two analyzers. In other words, it must depend on the overall experimental 
context. 

This result serves to reinforce the conclusion reached earlier, that the MKC models are highly non-classical. It 
also raises the question, whether the MKC models are capable of reproducing the empirical predictions of quantum 
mechanics — as discussed in the next section. 

The argument which follows is an improved version of an argument given in Appleby |jl5| . In the first place we have 
improved the argument so as to take account of the points made in Section [n] of this paper. In the second place, the 
version we give now does not involve joint measurements of non-commuting observables. In the third place, we have 
strengthened the argument, using ideas derived from the subsequent work of SBZL. 
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FIG. 2. Sequential measurement of (e i ■ S) 2 , (e 2 • S) 2 , (e 3 • S) 2 . The system passes through a succession of analyzers, which 
measure each observable in turn. 

Consider sequential measurements of the observables (ei ■ S) 2 , (e 2 ■ S) 2 , (e3 • S) 2 , using the apparatus illustrated in 
Fig. H The apparatus consists of three identical, non-ideal analyzers chained together. S denotes the spin vector of 
a spin 1 system, and {e 1 ,e 2 ,e 3 } is an arbitrary orthonormal triad. The assumption that {e!,e 2 ,e3} can be chosen 
arbitrarily is justified in Section [y|. 

The fact that there may be errors in the alignment of the analyzers is already included in the fact that the analyzers 
are not assumed ideal (see Section |n|). It may therefore be assumed that the nominal triad {ei,e2,e3} is strictly 
orthogonal. 

Suppose that one of these analyzers is used on its own, to measure the single observable (n • SJ . MKC postulate 
that the measurement will reveal the value of some neighbouring vector n' 6 5 2 (where, as usual, Sq denotes the 
dense, KS-colourable set on which the MKC valuations are defined). For each n there is a probability measure /i n 
such that Hn{A) is the probability that n' E A C Sq. 

Now consider the situation when the analyzers are used jointly, to measure (ei -S) 2 , (e 2 -S) 2 , (e3 -S) 2 . Let (e' 1; e 2 , e 3 ) 
be the triad whose values are revealed by the measurement. If the behaviour of each analyzer is independent of the 
context in which it is used, then the probability that (ei,e 2 ,e 3 ) £ A\ x A 2 x A3 must be fi ei (Ai)fi e2 (A 2 )/Lt e3 (A3). 

It can be seen that, if the behaviour of the analyzers is context-independent, then the triad {e'^e'^eg} is not 
guaranteed to be strictly orthogonal (unlike the triad {ei, e 2 , e 3 }, which is orthogonal by assumption). Consequently, 
the measurement outcome is not guaranteed to be one of the "legal" combinations 110, 101, Oil. In Appendix ^ we 
use this to show that, if the analyzers are sufficiently near-ideal, then the assumption of context-independence conflicts 
with the empirical predictions of quantum mechanics. Specifically, we show that context-independence implies 

Ne q > ~ (10) 

where e q is the quantum non-ideality index defined by Eq. (Q), and N is the smallest number of distinct orthonormal 
triads which cannot be consistently coloured. 
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Inequality ( [10| ) shows that, if the analyzers are sufficiently near-ideal, so that e q < 1/(2N), then the observable 
whose value is revealed by one analyzer must depend on the settings of the other two analyzers. This is a form of 
contextuality. 

Inequality ([Io|) was derived on the assumption that a measurement of (n • S) always reveals the pre-existing value 
of n' € Sq. Suppose we make the weaker assumption, that the value of n' is only revealed with probability > 1 — S c . 
Then a similar argument to that given in Appendix [Lj shows 

Ne q + S c >^ (11) 

if the behaviour of the analyzers is context-independent. This inequality should be compared with Inequality (J3j) - 
It should be noted that essentially the same argument goes through in the case where the measurements are made 

non-sequentially. We have chosen to focus on the sequential case because that is where the conflict with classical 

assumptions emerges most clearly. 

Kent p8| has argued that sequential measurements are not joint measurements. According to him a process does 

not count as a joint measurement unless the different observables are measured "all at once" , in parallel so to speak. 

We consider this objection to be unjustified, for the reasons explained in Appendix |j. 



VIII. ARE THE MKC MODELS EMPIRICALLY EQUIVALENT TO QUANTUM MECHANICS? 

The purpose of this section is to discuss the question, whether there actually exists a complete theory of MKC type 
which is empirically equivalent to quantum mechanics. In particular, we will discuss a recent claim by Cabello P0| , 
that theories of MKC type make empirically testable predictions which definitely conflict with those of quantum 
mechanics. Our conclusion will be that, although Cabello makes some very pertinent points, the question remains 
open. 

To begin with it should be noted that, as Clifton and Kent ||] themselves emphasize, the MKC models are, as they 
stand, incomplete, since they do not include any dynamical postulate. The result proved in the last section illustrates 
this point. We showed that, if the analyzers in Fig. ||are sufficiently near-ideal, MKC's assumptions require there to 
be a delicately adjusted "conspiracy" , whereby the behaviour of one analyzer is influenced by the settings of the other 
two analzyers in just that particular manner which is required to secure agreement with the empirical predictions 
of quantum mechanics. In a complete hidden variables theory this conspiracy would be explained in terms of the 
dynamics of the hidden variables describing the system+apparatus+environment composite. However, MKC do not 
consider the dynamics. Consequently, their account does not show that it is actually possible to construct a complete, 
dynamical theory having all the required properties. 

Let V& be the set of projections on which the MKC valuations are defined, and let A be the hidden state space. 
Clifton and Kent Q show that one can associate to each Pgfja function P: A — > {0, 1}, and to each quantum state 
i\) a probability measure such that 

(V|P|V) = / P(A)t&^ (12) 

J A 

Let us call this their statistical theorem. 

In order to obtain a complete hidden variables theory, they would need to supplement this result with a dynamical 
theorem. They would need to show that there exists a one-parameter family of mappings F t : A — > A such that 
F t! ° F t2 = F tl+t2 and 

v^ t =v^o Ff 1 (13) 

for all t, ip (where ipt denotes the Schrodinger picture state into which tj) evolves after a time t). It is a non-trivial 
question, whether such mappings actually exist. 

There are three weaknesses in MKC's argument. MKC's mathematical results — the existence of dense KS-colourable 
sets, and Clifton and Kent's statistical theorem — are clearly formulated, and rigorously proved. However, the physical 
part of their argument, to show that these results have the effect of "nullifying" the KS theorem, is much less clear, 
as Cabello Q and Mermin Q have stressed. The counter-argument we presented in Sections |n|-[lV| was directed at 
this point of weakness. We showed that the physical part of MKC's argument disintegrates when subjected to careful 
analysis. 

The second point of weakness is the lack of any dynamical theorem. If it could be shown that it is definitely not 
possible to define a dynamical evolution satisfying Eq. (|l^), then it would follow that there does not exist a complete 
theory of MKC type which is empirically equivalent to quantum mechanics. 



12 



The third weakness is that MKC only consider joint measurements of commuting observables. In a genuinely non- 
contextual theory there should be a principle of approximate non-contextuality, applying to joint measurements of 
non-commuting observables |15| . 

The argument in Cabello [ [20| , however, is not directed at any of these points of weakness. Instead, Cabello claims 
to have derived a result (his Lemmas 3 and 4) which directly contradicts Clifton and Kent's statistical theorem. There 
are only two possibilities here: either there must be a technical mistake in the proof of Clifton and Kent's statistical 
theorem, or else there must be a technical mistake in the proof of Cabello's Lemmas 3 and 4. It appears to us that 
the mistake is in fact on Cabello's side. 

MKC postulate that the effect of measuring an observable P is to reveal the pre-existing value of some other 
observable P' which is very close to P. Cabello (see footnote 27 in his paper) takes this to imply that a sufficiently 
high precision measurement of P will, with high probability, reveal the pre-existing value of P itself (if P is in the set 
Vd on which the valuation is defined). In other words he assumes (in the notation of Eq. ( |l2[ ) above) that, if P' sa P 
then there is a high probability that P'(A) = P(X). He fails to allow for the fact, which we proved in Section |l| 
and Appendix [b|, that the MKC valuations are radically discontinuous. This means that a knowledge of the value of 
P' does not, in general, convey any information at all regarding the value of P, no matter how small the difference 
P'-P. 

If P' « P then it follows from Eq. ( |l2| ) that J A (P'(X) — P{X)) dv^ w for all -0. However, this does not imply 
J A l-P'(A) — P(A)| du^ « 0, as assumed by Cabello. 

Cabello's argument can, in fact, be turned around, to disprove the assumption on which it depends. It follows from 
Clifton and Kent's statistical theorem that the conclusion to Cabello's Lemma 3 is false. So what Cabello has really 
done is to prove, by reductio ad absurdum, that there are cases where J. \P'{\) — P(X)\ dv^ is not negligible, even 

though P' w P. He has similarly proved that there are cases for which there is a non-negligible probability of two 
nearly orthogonal projectors both evaluating to 1. 

Of course, in a genuinely non-contextual theory, there would be a high probability of P' having the same value 
as P, if P' — P is sufficiently small. The fact that the MKC models do not have this property means that they are 



not non-contextual — as we showed in Section III. However, Cabello's argument is not directed at MKC's claim, that 
their models are non-contextual. It is directed, instead, at MKC's claim, that their models reproduce the empirical 
predictions of quantum mechanics. There is no evident contradiction between this latter claim and the fact that the 
valuations in their models are highly discontinuous. 

It should, however, be stressed that, although it has not been proved that there does not exist a complete theory 
of MKC type which is empirically equivalent to quantum mechanics, the contrary proposition has not been proved 
either. The question, in short, remains open. Furthermore, although Cabello fails to establish his main contention, 
some of his intermediate results are of considerable interest. It is possible that further investigation along these lines 
would show that Cabello's contention is, after all, correct. However, it appears to us that such a programme is unlikely 
to succeed unless it is directed at one of the points of weakness indicated above, rather than at Clifton and Kent's 
statistical theorem (which seems to have been rigorously proved). 

IX. CONCLUSION 

A substantial part of this paper has been devoted to a criticism of MKC. We ought to stress, by way of conclusion, 
that we actually consider that they have made a very important contribution to this subject. We have argued that 
MKC's claim, that their models "nullify" the KS theorem, was premature; in fact, fallacious. However, the models 
themselves are deeply interesting. They confirm the fact that there are depths to the KS theorem which, in the past, 
have not been sufficiently appreciated ("confirm" because this point had, in fact, already been established by the 
constructions of Pitowsky ||). 

MKC take it to be almost self-evident that the existence of dense KS-colourable subsets of S 2 "nullifies" the KS 
theorem. We have argued that they are mistaken, and that there is actually no nullification. However, the mistake is 
far from obvious. MKC's contention seems, at first sight, highly plausible. At least, it seemed highly plausible to the 
present author, before he began to seriously engage with the problem. 

In order to see that the MKC models do not "nullify" the KS theorem it is necessary to appreciate the point made 



in Section III, that a process does not count as a classical measurement unless it reveals, not simply a pre-existing 
value, but a pre-existing piece of classical information. To put it another way, the outcome of a classical measurement 
is, not simply a bare number, but a determinate proposition, which expresses some pre-existing classical fact. It is also 
necessary to appreciate the points made in Section || and Appendix wL regarding non- ideal quantum measurements. 
In previous accounts these points have not been explicitly recognized. They are, however, crucial, as the MKC models 
demonstrate. 
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Although the MKC models do not "nullify" the KS theorem, they do cast important new light on the physical 
implications of the KS theorem. They show that there are subtleties to the physical interpretation of the KS theorem 
which have not, in the past, been sufficiently emphasized. For this reason the work of MKC is valuable. 

Although we have presented our discussion in the form of a criticism it might, in some ways, be more appropriate if 
it was seen, in terms which are less prejudicial, as one more contribution to a line of investigation which Pitowsky 
initiated, and which MKC have continued. It is a line of investigation which might be worth pursuing further. 

Pitowsky's constructions are particularly interesting in this respect, because he challenges traditional assumptions 
at a much more fundamental level. For instance, he challenges the assumption that sets which are not Lebesgue- 
measurable have no physical significance. Pitowsky's idea, that "one can, as it were, reduce physical paradox to a 
mathematical 'pathology' " , is unlikely to lead to a classical explanation of quantum phenomena (a "pathological" 
theory — which is to say, a theory that seems strange as judged by the standards of ordinary intuition — is a non- 
classical theory, almost by definition). However, that does not seem to be Pitowsky's intention. Rather, he seeks to 
gain additional insight, into what quantum mechanics is actually telling us, by considering the problem from a new 
and unusual perspective. It is a deeply interesting idea, which deserves to be more widely known. 
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APPENDIX A: APPROXIMATE VON NEUMAMNN MEASUREMENTS 

In Section || we criticized MKC's concept, of the single observable that is actually measured, for the special case of 
the Stern-Gerlach apparatus. The purpose of this Appendix is to show that our argument generalizes, to the case of 
any approximate von Neumann measurement (i.e. any measurement described by a unitary operator which is close 
to the unitary operator describing a von Neumann measurement). 

Consider, to begin with, a strict von Neumann measurement of the observable A = ^2 a€£ a\a)(a\ (where £ is the 
set of eigenvalues) acting on an n dimensional Hilbert space Ti.. For the sake of simplicity we assume that A is non- 
degenerate. In order to perform the measurement A is coupled to a pointer observable jl = X) a e£ a l a )ptpt(a| acting 
on a different n-dimensional Hilbert space 7i p t (the state space of the measuring instrument). The measurement 
interaction causes the evolution 

U\a) ® |0o) P t = \a) <8> \a) pt (Al) 

for all a, where U is the unitary evolution operator describing the measurement interaction, and |0o)pt is the (fixed) 
initial state of the measuring instrume nt. 

The measurement described by Eq. jAl| ) is ideal because Q 

1. It discriminates eigenstates with perfect reliability (if the system is initially in the eigenstate \a), then the final 
pointer reading will certainly be a) . 

2. The final state correlations are perfect (if the final pointer reading is found to be a, then the final state of the 
system is |o)). 



In practice it would be difficult, if not impossible to construct an apparatus such that U satisfied Eq. (Al) precisely. 
A more realistic model of the kind of measurement which it might actually be possible to perform, in the laboratory, 
would be 

U\a) <8S |<£o} P t = | a) ® |a) P t + ^ e «> 6c ^ ® I c )p* ( A2 ) 

b,ce£ 

where the coefficients e a ,& c are small. 

Of cou rse, the model measurement process described by Eq. flA2|), though more realistic than the one described by 
Eq. ( |Al[ ), is still highly idealized. A real laboratory instrument is a complex macroscopic system having many other 
degrees of freedom apart from the pointer observable. It is also an open system, undergoing non-negligible interactions 



14 



with its environment (these interactions play an important role in accounting for the fact that system+instrument 
are in a mixed state at the end of the measurement process Lastly, it should be noted that in a real laboratory 

measurement the final state correlations are often (though not always) very poor (for instance, in the traditional 
Stern-Gerlach arrangement, where the particle hits a photographic plate, the result of the measurement does not 
contain useful information regarding the final spin state of the particle). However, the model is sufficiently realistic 
for present purposes. 



The unitary operator U in Eq. (A2) is close to the unitary operator describing a von Neumann measurement. In 
other words, it describes an approximate von Neumann measurement. The process is not an ideal measurement of 
A. It is, however, a non-ideal measurement (which is the most that can reasonably be demanded of a real laboratory 
procedure). Specifically: 

1'. It discriminates eigenstates with a high degree of reliability (if the system is initially in the eigenstate |a), then 
there is a high probability that the final pointer reading will approximately be a). 

2'. The final state correlations are good (if the final pointer reading is found to be a then Tr((A — a) 2 pj i=a 0, where 
p is the reduced density matrix describing the final state of the system, after the pointer has been read). 

These properties should be compared with the defining characteristics of an ideal measurement (items 1 and 2 above) . 

We now show that the process also effects a non-ideal measurement of every other observable which is sufficiently 
close to A. 

Choose some continuous parameterization for the ONBs (orthonormal bases) in TC, so that |a, v) is the ONB with 
parameter value v. Let i>o be parameter value corresponding to the original ONB: \a, vq) = \a). Then an arbitrary 
observable B can be expressed in the form 

B = A(u,S) = J2( a + Sa)\a,is)(a,v\ (A3) 
for suitable v,5. If B is close to A then we can choose v as vq and 5«0. 



Eq. (A2) can be re-written in terms of the ONB \a, v): 

U\a, v) ® |0 o )pt = \a, v) (g> \a) pt + ^ e a>bc (v) \b, v) ® |c) pt (A4) 

b,ces 

where the coefficients e a ,bc{y) vary continuously with v. It follows that, if v « vq and S sa 0, then the coefficients 
£a,bc{v) will be small, and so the process will be a non-ideal measurement of A(u, S) (in the sense of the definition 
expressed by items 1' and 2' above). 

We conclude that U describes a non-ideal measurement of B for every B close to A — as claimed. 

MKC assume that, given any non- ideal measurement of A, there will be a neighbouring observable B which is mea- 
sured ideally, and which they take to be the only observable that is "actually" measured. We noted in Section [il] that 
there exists at least one class of non-ideal measurements for which this assumption is invalid: namely, Stern-Gerlach 
measurements. A Stern-Gerlach measurement is not an ideal measurement of anything: not an ideal measurement of 
the nominal observable, which the experimeter records as having been measured, and not an ideal measurement of 
any other observable either [^T) . Measurements of this kind might be described as completely non-ideal. It is easily 
seen that they are, in a certain well-defi ned sense, typical, or generic. 

If the measurement described by Eq. ( |A2|) is of the special kind assumed by MKC then 

U\a, v) ® |0 o )pt = e Wa | a, v) ® |a) pt (A5) 

for some v « Vq and vector 0. We may thus identify the set of all such measurements with a neighbourhood of the 
identity in the group U[n), having dimension n 2 . On the other hand, the space of coefficients e a .bc may be identified 
with a neighbourhood in the coset space U(n 2 )/U{n 2 — n), having dimension n 2 (2n — 1). It follows that, if n > 1, 
the set of measurements assumed by MKC constitute a proper, lower-dimensional submanifold of the manifold of 
all non-ideal measurements. This means that, if one were to select a set of coefficients e a ,bc satisfying the unitarity 
condition at random, then, for many reasonable choices of probability measure, there would be probability zero that 
the measurement which resulted was of the special kind assumed by MKC. In any case, the contrary hypothesis, 
that a randomly chosen laboratory measurement is likely to be of the kind assumed by MKC, cannot be considered 
plausible. 

Suppose, however, that a laboratory measurement of A did happen to be of the very special kind assumed by MKC. 
Then there would be some observable B w A which was measured ideally. However, MKC would still not be justified 
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in arguing that B is the only observable that is "actually" measured. The fact that the measurement of B is slightly 
better than the measurement of A does not imply that A is not measured at all. A measurement does not have to be 
strictly ideal in order to count as a measurement (otherwise one would have to say that a Stern-Gerlach apparatus 
does not measure spin, and that real photon detectors do not detect photons). 



APPENDIX B: LEMMAS CONCERNING THE REGIONS C AND D. 



The purpose of this appendix is to prove those results to which we appealed, without proof, in Section III. For 



ease of reference we state the results as lemmas. The reader should refer to Section III for definitions of notation and 
terminology. 

Lemma 1 The continuity region C is KS- colourable. Furthermore f extends by continuity to a uniquely 
defined continuous KS-colouring f:C—> {0, 1} . 

Let Co (respectively C\) consist of those vectors n G C having a neighbourhood U such that /(n') = (respectively 
/(n') = 1) for all n' G U n Sg. 

Co and C\ partition C into two disjoint (open) subsets. We may therefore define a function /: C — ► {0, 1} by setting 
/(n) = if n e C , and /(n) = 1 if n € C x . 

Now let _{ei, e 2 , e 3 } be any orthonormal triad C C. By construction, there exists a triad {ei,e 2 ,e 3 } C C n S 2 
such that /(e r ) = f(e' r ) for r = 1,2, 3. It follows that {ei, e2, 63} must /-evaluate to one of the "legal" combinations 
Oil, 101, 110. A similar argument shows that two orthogonal vectors S C cannot both /-evaluate to 0, even if their 
vector product ^ C. 

It follows that / is a KS-colouring of the set C. The claim is now immediate. 

Lemma 2 The discontinuity region D has non-empty interior. 

Let {ni,ri2, . . . n#} be any finite KS-uncolourable set. The fact that C (the complement of D) is KS-colourable 
means that at least one of these vectors must S D. Suppose that the labelling is such that, for some r > 1, 
{ni,n 2 , . . . n r } C D and {n r+ i,n r+2 , ■ ■ ■ n fl } C C. 

For each s = 1, . . . , r let e s be the angular distance from n s to C, and for each t = r + 1, . . . , R let 5t be the 
angular distance from n t to D. Define S = min( r _|_i)<t<# (S t ). The fact that C is open means that 5 > (unless C is 
empty — in which case the claim is immediate). 

If it should happen that e s > for some s, then n s is in the interior of D — which means that the interior of D 
is non-empty. Otherwise we can find a suitable rotation, through an angle < 5, which will take one or more of the 
vectors ni, . . . , n r into C, while not taking any of the vectors n r+ i, . . . , n# out of C. After re-labelling this gives us 
a new KS-uncolourable set {n^, n 2 , . . . n' R } with {n' 1; n 2 , . . . n' r ,} C D and {n' r , +1 ,n' r , +2 , . . . n' R } C C for some r' < r. 

If it should still happen that none of the vectors n^, . . . , n' r , is in the interior of D we can repeat the procedure. 

It is impossible to rotate all the vectors into C, so after sufficiently many iterations of the procedure at least one of 
the vectors must be in the interior of D. It follows that the interior of D cannot be empty. 

We conclude by noting that this argument actually establishes the more general statement, that the complement 
of any open KS-colourable set must have non-empty interior. 

Lemma 3 There is a fixed, model-independent, non-zero lower bound on the solid angle subtended by D 

Let us note that it already follows from Lemma 2 that the solid angle subtended by D must be > 0, for any given 
colouring. Lemma 3 implies a stronger statement: that one cannot, by choosing the colouring appropriately, make 
the solid angle arbitrarily small. 

We begin by establishing some notation. Let ijl be the invariant measure on S 2 , normalized so that n(S 2 ) — 1 (i.e. 
the solid angle scaled by l/(47r)). Let O (respectively B) be the set of all open (respectively Borel, or /^-measurable) 
subsets of S 2 which are KS-colourable. Define 

d = 1 - sup (fi(U)) (Bl) 
ueo 

d B = l- sup (/!(£?)) (B2) 

Clearly, do > drs (it is a non-trivial question as to whether do actually = d&). 

We will show that ds > 0. In view of the fact that C G O and D = S 2 — C this implies 
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H(D) >d o >d B >0 (B3) 

which establishes the claim. 

Before proceeding to the proof, let us note that it follows from the last paragraph in the proof of Lemma 2 that 
H{U) > for all U £ O. The proof which follows is needed to show that there is not a sequence of sets U n £ O such 
that (J,(U n ) — ► 0. This may appear intuitively obvious. However, the fact that there exist dense KS-colourable subsets 
of S 2 was intuitively surprising (at least to the present writer), and so it behoves us to be careful. 

We now proceed to the proof that ds > 0. 

Choose some fixed, finite, KS-uncolourable set {ni,ri2, . . . , 112M} C S 2 , with the property that riM+r = n r for 
r = l,...,M. 

Let 9q be the minimum angular separation of the vectors belonging to this set: 

On = min (cos (n r ■ n s )) (B4) 

l<r,s<M y ' 

Surround each vector n r with a circular patch E r of radius #o/2: 

E r = {m £ S 2 : cos _1 (m • n r ) < 6 /2} (B5) 

Let B be any set £ B, and let B c = S 2 — B be its complement. We may assume, without loss of generality, 
that B is invariant under the parity operation (implying that B c is also invariant). By construction, the sets E r are 
non-intersecting, with the possible exception of a set of measure zero on their boundaries. Consequently 

2M 

^B c )>J2KE r nB c ) (B6) 

r-l 

We next define, for each r, a function g r : S 2 — > E r by 

g r (m) = e (e ° /2)m - L n r (B7) 

where Li,L2,L^ are the generators of SO(3). Thus, g r (m) is the vector obtained by rotating n r through a (fixed) 
angle 9o/2 about the (variable) axis m. It is easy to sec that, as m ranges over the whole of S 2 , the interior of E r is 
covered twice, and the boundary once. Consequently 

fi(E r nB c ) = l ( J r (m) d(i (B8) 
2 J bc 

where J r is the Jacobian of g r , and B^ = g~ 1 (E r n B c ). Define, for each m £ S 2 , 

J(m) = min (J r (m)) (B9) 



Eqs. (g6|) and (|B8|) then imply 

K B 1 > ~ \ I J(m)d f i+ [ J(m)dA (BIO) 

We now observe that, for each fixed m £ S* 2 , the set {gi(m), 52(111), . . . , giM ( m )}) being obtained by rotating the 
KS-uncolourable set {n!,n 2 , . . .n 2 j\/}, must itself be KS-uncolourable. On the other hand, B is KS-colourable. It 
follows that, for each m £ S 2 , we must have <7 r (m) £ B c and <?M+r( m ) £ B c for some r < M. Consequently 
^ =1 B c r = U 2 r A J M+1 B c r = S 2 , and 

KB C ) > J J(m)d^i (Bll) 
Since the right-hand side of this inequality is independent of B we may deduce 



sup(/x(B)) <1- / J(m)dn (B12) 

To complete the proof we note that J(m) is a continuous, non-negative function having finitely many (in fact 2M) 
zeros. It follows that 

ds > J J(m) dfj, > (B13) 
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Concluding Remarks 



It is natural to consider, in addition to the sets O and B defined above, the set C, consisting of all closed, KS- 
colourable subsets of S 2 . Define dc = 1 — sup^ gC (/i(if)). Clearly, dc > d$. It is a standard result (see, for 
example, Halmos [0, Chap. 10) that, for each Borel set -B, there exists a sequence of closed sets K n C B such that 
n(B) = lim JWOO (fi(K„)) . It follows that we also have dc < dg. Combining these inequalities we deduce 

d >d c = d B (B14) 

The numbers do and dc — ds could be regarded as non-classicality indices (the larger they are, the more radically 
quantum mechanics departs from classical assumptions). From this point of view it would be interesting to know 
their actual values. 



The evaluation of the integral on the right-hand side of Inequality ( B13 ) , though s trai ghtforward, would be somewhat 



tedious. We will therefore content ourselves with noting that it follows from Eq. ( ]B9| ) that 

J J(m)d/x < J J r {m)dn = 2 J dfi = 2sin 2 (^j (B15) 

where r is any integer in the range 1, . . . , 2M, and where 9q is the minimum angular separation of the vectors in the 
KS-uncolourable set considered. In the case of the Conway-Kochen set (||J] one has 8p = 18.4° (the angle between 
the vectors (0, 1, 2) and (0, 2, 2)), implying that the right-hand side of Inequality ( B13| ) i s < .013. 



One would like to know if do is of the same order as the lower bound set by Inequality ( B13 ) — implying that > 99% 
of the area of S 2 can be covered with an open KS-colourable subset — or whether the actual value is significantly larger. 
This is a question which it might be interesting to investigate further. It might also be interesting to investigate the 
analogous quantities defined on Hilbert spaces of dimension > 3. In particular, it might be interesting to investigate 
the way in which these quantities vary with the dimension of the Hilbert space. 



APPENDIX C: THE "OPERATIONAL" APPROACH OF SIMON ET AL AND LARSSON 

Simon et al jl7|,[l9) and Larsson |L8| (SBZL) claim to have proved an operational generalization of the KS theorem, 
applying to real experiments (Larsson, however, does not use the word "operational"). Their result inspired the 
generalized KS theorem proved in Section [v| of this paper. It also inspired part of the argument in Section VII 



However, although we are very significantly indebted to SBZL, we also have some significant criticisms. We will 
argue, in fact, that their operational version of the theorem is invalid. 

In the following we begin by criticizing SBZL's operational definition of contextuality. We go on to identify a 
loophole in SBZL's argument. We conclude by exhibiting a model which gets through the loophole, and which 
provides a concrete counter-example to their operational version of the generalized KS theorem. 

SBZL consider an instrument which performs measurements of each of the N triads contained in some fixed KS 
uncolourable set. The triad to be measured is determined by three switches. They assume that the system+instrument 
combination is fully specified by: (1) the three switch positions ei,e2,e3, (2) the system hidden variables As, (3) the 
instrument hidden variables Ai (Larsson, however, denotes the pair (As,Ai) by the single symbol A). In general, the 
r th pointer reading is given by a function X r (ei, e2, e3, As, Ai) which depends on all three switch positions, as well as 
the hidden variables. SBZL take it that a theory is non-contextual in an operational sense if and only if there exists, 
for each measuring instrument, a single function X such that 

AT r (ei,e 2 ,e 3 , A s , Ai) = X(e r ,X s ,Xi) (CI) 

for r = 1,2,3. 

This definition has the following features: (1) if the instrument is different then the function X may be different 
and (2) X depends on the instrument hidden variables Ai, as well as the system hidden variables As- For these reasons 
it is not a definition of non-contextuality in the ordinary sense. 

A non-contextual theory (in the ordinary sense) is a theory in which measurements reveal the pre-existing values of 
the observables measured. It is, in other words, a classical theory, at least so far as the concept of measurement is con- 
cerned. Non-contextuality in SBZL's operational sense, by contrast, does not have the implication that measurements 
reveal pre-existing values (of the observables measured). It is not a signature of classicality. In fact, we will show 
that it is not even inconsistent with the empirical predictions of quantum mechanics. It is therefore inappropriate. At 
least, it is inappropriate if the aim is to identify hypotheses which the empirically verifiable predictions of quantum 
mechanics exclude. 
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Before proceeding further it will be convenient to digress, and consider the ordinary concept of non-contextuality 
in a little more detail. 

The context of a measurement is the particular procedure used to carry out the measurement. Alternatively, it 
is what Bohr EJ describes as the "whole experimental arrangement" , and what Bell [§] describes as the "complete 
disposition of the apparatus" . The term "contextuality" or "context-dependence" refers to the phenomenon in which 
different procedures for measuring the same observable have different outcomes, even though the hidden state of the 
system is initially the same. 

A theory in which measurements reveal pre-existing values (of the observables measured) is obviously guaranteed 
to be non-contextual. The converse is also true: if a theory is non-contextual, then measurements reveal pre-existing 
values (of the observables measured) . It may perhaps happen that pre-existing values are not explicitly mentioned, in 
the formal description of a non-contextual theory. Nevertheless, they must be present implicitly (since, if the theory 
predicts that a measurement of A will always produce the value a, then that effectively means the theory assigns to 
A the value a). To say that a theory is non-contextual is logically equivalent to saying that it is a theory in which 
measurements reveal pre-existing values (of the observables measured). 

There is a tendency, in formal discussions, to focus on one particular kind of contextuality: namely, the kind where 
the result of measuring A depends on which other commuting observables B,C, . . . are jointly measured along with 
A. It is an important kind of contextuality because it lends itself to the proof of general theorems. However, there 
are many other kinds. 

A difference in measurement context is any difference in the detailed specification of the apparatus used for per- 
forming the measurement. It need not be a major difference. Suppose, for example, that the shape of the pole pieces 
in a Stern-Gerlach arrangement was slightly changed. This would count as a change of context. A real laboratory 
instrument admits numerous minor variations on the basic design. A theory is not genuinely non-contextual unless 
measurem ent outcomes are insensitive to every such variation. 

In Eq. ( |Cl| ) the function X depends on the instrument, and on Ai. It follows that SBZL's operational criterion 
does not imply that measurement outcomes are genuinely independent of context. SBZL tend to focus on the switch 
positions and pointer readings as the features of the measuring apparatus of which the experimenter has knowledge 



("the switch position is all he knows about", as it is expressed in ref. |17|). However a real measuring apparatus has 
many other controllable parameters: for example, the number of turns in each coil, the exact position of each screw, 
the pressure in each evacuated chamber, etc. etc. A small change in one of these parameters will be reflected by a 
change in the probability distribution on the variables Ai, if not by a change in the function X itself. Consequently, 
it may have a large affect on the measurement outcome, even in a theory satisfying SBZL's operational criterion. 

SBZL's operational criterion is, in one way, too weak. Not only does their criterion fail to exclude models which are 
highly non-classical. It is not even inconsistent with the empirical predictions of quantum mechanics (see the example 
at the end of this appendix). 

Their criterion is, in another way, too strong. We stated earlier that a non-contextual theory is one in which mea- 
surements reveal pre-existing values (of the observables measured). However, this only applies to ideal measurements. 
Non-ideal measurements do not certainly reveal exact pre-existing values, and so they may only be approximately 
independent of context. SBZL's criterion is meant to apply to real laboratory measurements. It is therefore unduly 
restrictive in requiring that, for given As, Ai, measurements of ei in the contexts {ei, 62,63} and {ei,e 2 ,e 3 } must, 
with certainty, have the same outcome (this point is closely related to the point we made at the end of Section 
when we argued that the condition iVe q < 1 is not sufficient to ensure a contradi ctio n with classical assumptions). 



In short, the criterion for a theory to be genuinely non-contextual is, not Eq. (CI), but 



X r (ei,e 2 ,e 3 , A s , Ai) = /(e r ,A s ) (C2) 

for all As, Ai in a set of probability measure rs 1, and for some function / which (unlike X and X r ) is the same for all 
measuring instruments. It will be seen that this is, in effect, the criterion we used in Section |v|. 
Let us now consider SBZL's argument. 

SBZL consider a set of N switch settings (e\ , e\ , ) , . . . , (e^ , , ) which cannot consistently be coloured. For 
each r, let A r be the set of hidden variables (As, A/) such that 

X(e{, A s , A,) + X(e r 2 , A s , A,) + X(e r 3 , X s , Xi) ± 2 (C3) 



where X is the function defined by Eq. (CI) above. 

Let As and Ai be the hidden state spaces of system and instrument respectively, and let [i be the probability 
measure on A5 x A/ describing the system+instrument combination. SBZL make the crucial assumption, that /i is 
the same for all switch settings. 

It follows from the definitions that fi(A r ) < e q for all r (where e q is the quantity which SBZL denote e, and which 
is defined by Eq. (Ill) of this paper). In view of the fact that U^ =1 A r = As x Aj this implies 
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(N \ N 

(J A- I <J2K A r)<Ne q 
r=l / r=l 



(C4) 



SBZL conclude that theories which are non-contextual in their operational sense cannot reproduce the quantum 
mechanical predictions for an apparatus with e q < l/N. 

The problem with this argument is the fact that it rests on the assumption that n is the same for all switch 
settings. The assumption is not justified. Changing the switch settings changes the quantum state of the instrument. 
In a hidden variables theory the quantum state of the instrument is represented by a probability measure on S x Ai 
(where E is the space of switch settings) . It is reasonable to assume that this probability measure will be of the form 
5 X v, where S is a (5-function measure on S. However, there is no reason to assume that v is the same for all switch 
settings — which is what would be needed if SBZL's assumption was to be justified. 

To find an example of a hidden variables theory to which SBZL's assumption is inapplicable, one does not need to 
look further than the Bohm theory. In the Bohm theory the probability distribution is given by the square of the 
modulus of the wave function, in the x-spacc representation. Changing the switch settings changes the wave function, 
and so it will, in general, change the probability distribution. 

It is not simply that SBZL's argument is not sufficient to establish their conclusion. Their conclusion is not true — as 
we now show, by means of a counter-example. The example we choose is a mo del of MKC type. It has the property 
e q = 0. The model is not non-contextual in the ordinary sense, defined by Eq. ( |C2| ). However, it is non-contextual in 
SBZL's operational sense, defined by Eq. (|Cl|). 

Let S 2 be the unit 2-sphere, and let S^be the dense, KS-colourable subset on which the MKC valuations are 
defined. For each As, let / As : Sq — > {0, 1} be the associated valuation. 

If the hidden state of the instrument is Ai the measurement reveals the pre-existing values of a triad ti(Ai) = 
{en(Ai), ei2(Ai), ei3(Ai)} C Sq. The probability measure on Ai is such that ti(Ai) is always extremely close to the 
triad of switch settings (this means that the probability measure must be strongly dependent on the switch settings — 
contrary to what is assumed by SBZL). 

Given any vector n 6 S 2 let s(n, Ai) be the index s for which the scalar product ei s (Ai) • n is maximal (ambiguities 
being resolved by taking the smallest such index). Then the r th pointer reading is given by 

Ar r (ei,e 2 ,e 3 ,A s , Ai) = / As (e Is(eViAl) (Ai)) (C5) 

Comparing this equation with Eq. ( pl| ) it can be seen that the model is non-contextual in SBZL's operational sense 
(with X(n, As, Ai) = / As (ei s ( njAl ) (Ai)]). On the other hand the fact that f\ s is a KS-colouring implies e q = 0. 

The example thus shows that SBZL's operational version of the generalized KS th eore m is invalid. 

Of course, the model is non-contextual in the ordinary sense, as defined by Eq. (|C^). Consequently, there is no 
contradiction with the result proved in Section that theories which are non-contextual in the ordinary sense cannot 
reproduce the quantum mechanical predictions for an apparatus with e q <C l/N. 

The model actually shows that non-contextuality in SBZL's operational sense is not inconsistent with the empirical 
predictions of quantum mechanics, even as regards the outcome of measurements for which e q = 0. Consequently, it 
also provides some additional grounds for considering their operational criterion to be inappropriate. 



APPENDIX D: CONTEXTUALITY OF SEQUENTIAL ANALYZERS 

The purpose of this appendix is to prove Inequality (|l0|), establishing that the observable whose value is revealed 
by a near-ideal analyzer must depend on context. 

Let f\ be the KS-colouring of Sq defined by the hidden state A. Define an induced valuation fx of the full unit 
2-sphere S 2 by 

f ^ _ / if G S 2 : / A (n') = 1}) < 0.5 

tx[Ii) ~ \ 1 if G S 2 : / A (n') - 1}) > 0.5 

for all n. The significance of this construction is that, if the hidden state is A, then there is probability > 0.5 that a 
measurement of n will have outcome / A (n). 

We now make the assumption, that the behaviour of the analyzers is independent of context. 

Let r be any orthonormal triad which / A -evaluates to one of the "illegal" combinations 000,011, 101, 110, 111. We 
will show that, if r is measured in the hidden state A, there is probability > 0.5 that the result will be "illegal". 

In fact, suppose that the / A -values of r are 000. Let p r be the probability that the result of measuring e r is 0. The 
defintion of fx implies that p r > 0.5 for each r. The assumption of context-independence implies that the probabilities 
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are independent. Consequently, the probability that the measurement outcome will be one of the "legal" combinations 
001, 010, 100 is 

Plogal =Plf>2(l -P3) +PlP3(l-P2) + P2P3^ ~ Pi) (D2) 

It is straightforward to verify that the expression on the right hand side is always < 0.5. A similar argument shows 
that piegai is also < 0.5 if the /^-values of r are one of the other four "illegal" combinations. We conclude that, in 
every case, there is probability > 0.5 that the measurement outcome will be "illegal" — as claimed. 

Now let T\ , . . . , Tj\r be a set of triads which cannot consistently be coloured. For each A choose an index r\ such that 
the /^-values of r rx are "illegal" . Let A be the set of all hidden states A. For each index r define A r = {A € A: r\ = r}. 
On the one hand, it follows from the result proved in the last paragraph that, if A € A r , then there is probability > 0.5 
that a measurement of r r will have an "illegal" outcome. On the other hand the sets Ai, . . . , Ajy partition A into N 
disjoint subsets, so there must be an index tq such that there is probability > 1/N that A 6 A ro . Combining these 
statements we deduce that there is probability > 1/(2N) that a measurement of r ro will have an "illegal" outcome. 
However, it follows from the definition that this probability must be < e q . 

We deduce that, if the behaviour of the analyzers is context-independent, then Ne q > 1/2. 



APPENDIX E: SEQUENTIAL MEASUREMENTS ARE JOINT MEASUREMENTS 

Kent [j38| has argued that a sequential measurement, such as the one illustrated in Fig.|2[ is not a joint measurement. 
According to him a process does not count as a joint measurement unless the different observables are measured all 
together, in parallel so to speak. The purpose of this appendix is to explain why we consider this position to be 
unjustified. 

Measuring instruments are defined functionally, in terms of what they do. This is true both classically and quantum 
mechanically. If a system performs the function, of an instrument which measures A, then it is an instrument which 
measures A. 

The function of a classical measuring instrument is to reveal pre-existing values. Suppose that one has a sealed box 
with a dial and two terminals on the front, and suppose that one wishes to confirm that it is a classical ammeter. This 
question can be decided without looking inside the box. It is sufficient to test the instrument with known currents, 
and check that it reads correctly. Anything which does the job of a classical ammeter, is a classical ammeter. 

A micrometer screw guage and an interferometer look very different. However, they both count as length measuring 
instruments. This is because they both do the job of a length measuring instrument. 

Similar considerations apply in quantum mechanics. The function of a quantum measuring instrument is to dis- 
criminate eigenstates. Suppose one has a sealed box, with a hole into which spin 1 particles can be injected, and a 
digital readout. Suppose it is found, after testing, that the box correctly identifies the joint eigenstates of (ei • S) 2 , 
(e2 • S) 2 , (e3 • S) 2 . Then that is enough to establish that it jointly measures these observables. 

Anything which discriminates the joint eigenstates of (ei • S) 2 , (ea • S) 2 , (e3 • S) 2 , jointly measures these observables. 
In particular, the apparatus illustrated in Fig. || jointly measures them. 

A quantum measurement can always be conceived in "black-box" terms. In order to decide whether a process 
is a quantum measurement of a given set of commuting observables it is only necessary to consider the state of 
system+apparatus at the time when the measurement interaction commences and again, at the time when the 
interaction is concluded (the difference between these times necessarily being > 0). If the initial and final states 
are related in the required manner, then the process is a measurement of the observables concerned. Questions as to 
the detailed behaviour of the state between these times, during the course of the interaction, are irrelevant. 
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